Incremental enrolment of speech recognizers

نویسندگان

  • Chafic Mokbel
  • Olivier Collin
چکیده

Classical adaptation approaches generally allow a reliably trained model to match a particular condition. In this paper, we define an incremental version of the segmental-EM algorithm. This method permits to incrementally enrich a model first trained with limited amount of data. Resource memory constraints allow only the initial data statistics to be stored. The proposed method uses these statistics by fixing, within the segmental EM algorithm applied on both initial and new data, the initial optimal paths in the model for the initial data. We proved theoretically that this is equivalent to the segmental MAP adaptation with specific choice of priors. Experimented on two speaker dependent telephone databases, the approach permitted to incrementally integrate new conditions of use. The performance was slightly less than that obtained with classical training over the whole data. As expected with the MAP interpretation of the algorithm, initial data characteristics influence largely the model evolution.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incremental Semantic Models for Continuous Context-Sensitive Speech Recognition∗

Context-sensitive speech recognizers use environment or discourse information to influence language model probabilities used in speech decoding. This is usually done by switching language models between utterances. This paper explores the use of a continuously context-sensitive language model that uses incremental interpretation to update context at every time step in decoding. Because it only ...

متن کامل

On-line incremental adaptation for speaker verification using maximum likelihood estimates of CDHMM parameters

This papers investigates two approaches to on-line incremental adaptation of CDHMM parameters. First the popular MAP approach is examined, highlighting di culties in automatically setting the adaptation rate. To overcome these problems we introduce a new approach based on the multi-observation estimation equations of the forward-backward algorithm called a cumulative likelihood estimate (CLE). ...

متن کامل

Combining forward-based and backward-based decoders for improved speech recognition performance

Combining outputs of speech recognizers is a known way of increasing speech recognition performance. The ROVER approach handles efficiently such combinations. In this paper we show that the best performance is not achieved by combining the outputs of the best set of recognizers, but rather by combining outputs of recognizers that rely on different processing components, and in particular on a d...

متن کامل

Using Articulatory Knowledge in Automatic Speech Recognition

Over the years different types of speech recognizers have been proposed and tested. During the last decade (or maybe even longer) hidden Markov models (HMMs) seem to have a better performance than other types of speech recognizers, like e.g. rule-based speech recognizers. This state of affairs has led to a gap between speech technology on the one hand, and phonetics and phonology on the other. ...

متن کامل

Language identification using acoustic log-likelihoods of syllable-like units

Automatic spoken language identification (LID) is the task of identifying the language from a short utterance of the speech signal uttered by an unknown speaker. The most successful approach to LID uses phone recognizers of several languages in parallel [Zissman, M.A., 1996. Comparison of four approaches to automatic language identification of telephone speech. IEEE Trans. Speech Audio Process....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999